Outlier Detection under Interval and Fuzzy Uncertainty: Algorithmic Solvability and Computational Complexity
نویسندگان
چکیده
In many application areas, it is important to detect outliers. Traditional engineering approach to outlier detection is that we start with some “normal” values , compute the sample average , the sample standard variation , and then mark a value as an outlier if is outside the -sigma interval (for some pre-selected parameter ). In real life, we often have only interval ranges for the normal values . In this case, we only have intervals of possible values for the bounds and ! . We can therefore identify outliers as values that are outside all -sigma intervals. In this paper, we analyze the computational complexity of these outlier detection problems, and provide efficient algorithms that solve some of these problems (under reasonable conditions). We also provide algorithms that estimate the degree of “outlier-ness” of a given value – measured as the largest value for which is outside the corresponding " -sigma interval.
منابع مشابه
Outlier Detection under Interval Uncertainty: Algorithmic Solvability and Computational Complexity
In many application areas, it is important to detect outliers. The traditional engineering approach to outlier detection is that we start with some “normal” values x1, . . . , xn, compute the sample average E, the sample standard variation σ, and then mark a value x as an outlier if x is outside the k0-sigma interval [E − k0 · σ, E + k0 · σ] (for some pre-selected parameter k0). In real life, w...
متن کاملEstimating information amount under uncertainty: algorithmic solvability and computational complexity
Sometimes, we know the probability of different values of the estimation error ∆x def = e x− x, sometimes, we only know the interval of possible values of ∆x, sometimes, we have interval bounds on the cdf of ∆x. To compare different measuring instruments, it is desirable to know which of them brings more information – i.e., it is desirable to gauge the amount of information. For probabilistic u...
متن کاملCombining Interval, Probabilistic, and Fuzzy Uncertainty: Foundations, Algorithms, Challenges – An Overview
Probabilistic and . . . Interval . . . Why Not Maximum . . . Chip Design: Case . . . General Approach: . . . Interval Approach: . . . Extension of Interval . . . Successes (cont-d) Challenges Problem Main Idea: Use Moments Formulation of the . . . Result Case Study: . . . General Problem Case Study: Detecting . . . Outlier Detection . . . Outlier Detection . . . Fuzzy Uncertainty: In . . . Ackn...
متن کاملUNCERTAINTY DATA CREATING INTERVAL-VALUED FUZZY RELATION IN DECISION MAKING MODEL WITH GENERAL PREFERENCE STRUCTURE
The paper introduces a new approach to preference structure, where from a weak preference relation derive the following relations:strict preference, indifference and incomparability, which by aggregations and negations are created and examined. We decomposing a preference relation into a strict preference, anindifference, and an incomparability relation.This approach allows one to quantify diff...
متن کاملA New Version of Earned Value Analysis for Mega Projects Under Interval-valued Fuzzy Environment
The earned value technique is a crucial and important technique in analysis and control the performance and progress of mega projects by integrating three elements of them, i.e., time, cost and scope. This paper proposes a new version of earned value analysis (EVA) to handle uncertainty in mega projects under interval-valued fuzzy (IVF)-environment. Considering that uncertainty is very common i...
متن کامل